Search CORE

62 research outputs found

High efficiency compression for object detection

Author: Bajic Ivan V.
Choi Hyomin
Publication venue
Publication date: 15/02/2018
Field of study

Image and video compression has traditionally been tailored to human vision. However, modern applications such as visual analytics and surveillance rely on computers seeing and analyzing the images before (or instead of) humans. For these applications, it is important to adjust compression to computer vision. In this paper we present a bit allocation and rate control strategy that is tailored to object detection. Using the initial convolutional layers of a state-of-the-art object detector, we create an importance map that can guide bit allocation to areas that are important for object detection. The proposed method enables bit rate savings of 7% or more compared to default HEVC, at the equivalent object detection rate.Comment: The paper is published in IEEE ICASSP 18

arXiv.org e-Print Archive

Crossref

Can you tell a face from a HEVC bitstream?

Author: Alvar Saeed Ranjbar
Bajic Ivan V.
Choi Hyomin
Publication venue
Publication date: 09/09/2017
Field of study

Image and video analytics are being increasingly used on a massive scale. Not only is the amount of data growing, but the complexity of the data processing pipelines is also increasing, thereby exacerbating the problem. It is becoming increasingly important to save computational resources wherever possible. We focus on one of the poster problems of visual analytics -- face detection -- and approach the issue of reducing the computation by asking: Is it possible to detect a face without full image reconstruction from the High Efficiency Video Coding (HEVC) bitstream? We demonstrate that this is indeed possible, with accuracy comparable to conventional face detection, by training a Convolutional Neural Network on the output of the HEVC entropy decoder

arXiv.org e-Print Archive

Crossref

De-sketching

Author: Ivan V. Bajic
Lior Bragilevsky
Publication venue
Publication date: 22/06/2018
Field of study

Many software applications exist for plotting graphs of mathematical functions, yet there are none (to our knowledge) that perform the inverse operation - estimating mathematical expressions from graphs. Since plotting graphs (especially by hand) is often referred to as "sketching," we refer to the inverse operation as "de-sketching." As the number of mathematical expressions that approximate a given curve can be quite large, in this demo we restrict our attention to polynomials, and present a deep model that performs de-sketching by finding the best second-degree polynomial to fit the curve in the input image. Currently, our trained model is able to provide reasonably accurate estimates of polynomial coefficients for both synthetically-generated and hand-drawn curves

Simon Fraser University Institutional Repository

The Data Gap in Sports Analytics and How to Close It

Author: Bajic Ivan V.
Harell Alon
Publication venue
Publication date: 19/11/2019
Field of study

As the importance and prevalence of sports analytics grows, so does the inequality in sports data. In this paper we examine two main sources of such disparity - the perceived hierarchy of sports and privatization of data. We argue that such inequality hurts the sports analytics community in the short and long terms, and suggest ways for the deep-learning, AI, and sports analytics communities to help mitigate the issue. Keywords: Sports Analytics; AI; Team Sports; Diversit

Simon Fraser University Institutional Repository

Online MoCap Data Coding with Bit Allocation, Rate Control, and Motion-Adaptive Post-Processing

Author: Bajic Ivan V.
Kwak Choong-Hoon
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2017
Field of study

With the advancements in methods for capturing 3D object motion, motion capture (MoCap) data are starting to be used beyond their traditional realm of animation and gaming in areas such as the arts, rehabilitation, automotive industry, remote interactions, and so on. As the amount of MoCap data increases, compression becomes crucial for further expansion and adoption of these technologies. In this paper, we extend our previous work on low-delay MoCap data compression by introducing two improvements. The first improvement is the bit allocation to long-term and short-term reference MoCap frames, which provides a 10-15% reduction in coded bitrate at the same quality. The second improvement is the post-processing in the form of motion-adaptive temporal low-pass filtering, which is able to provide another 9-13%savings in the bitrate. The experimental results also indicate that the proposed online MoCap codec is competitive with several state-of-the-art offline codecs. Overall, the proposed techniques integrate into a highly effective online MoCap codec that is suitable for low-delay applications, whose implementation is provided alongside this paper to aid further research in the field

Crossref

Simon Fraser University Institutional Repository